Executable Code Actions Elicit Better LLM Agents
https://arxiv.org/abs/2402.01030
This work proposes to use executable Python code to consolidate LLM agents' actions into a unified action space (CodeAct). (Abstract)
テキストやJSONで出力する代わりに実行可能なPythonコード
CodeActAgent, finetuned from Llama2 and Mistral, is integrated with Python interpreter and uniquely tailored to perform sophisticated tasks (e.g., model training) using existing libraries and autonomously self-debug.
https://github.com/xingyaoww/code-act
We implemented a containerized code execution engine based on JupyterKernelGateway.
https://github.com/xingyaoww/code-act/blob/main/scripts/chat/code_execution/start_jupyter_server.sh
https://github.com/xingyaoww/code-act/blob/main/scripts/chat/code_execution/api.py
jupyter-kernel-gateway
JupyterKernelなどを実装している! https://github.com/xingyaoww/code-act/blob/main/scripts/chat/code_execution/jupyter.py#L55
2. CodeAct Makes LLMs Better Agents(Figure 1)